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ACACCATTTG TCTTCATGTA ACCCCATTAG CTATACCCTC TAGTGCAAGG AAACCATAGG 
10 20 30 40 50 60 

GCCTAGGTCA CACCATGAGG CTGCNCTTAC AAGTTATGCA AAAACTATGG ACTTGGGAGA 
70 80 90 100 HO 120 

CCTGTGCGTA ACAACATCAC ACNCCAAATT TAACCAGCTC TCCCCATAAC AGCACGCTCA 
130 140 150 160 170 180 

TGTGTTACTG AGGAAATGCC TGTGGATTGG AGTGTGTTCT GTGTGCAGGA GGCTGGTCCA 
190 200 210 220 23 0 240 

GGTTTCACTT CTGCAGGACA CTGGACGTTT CCCAAAACCA GCAGACTTTC CCCACGTGCA 
250 260 270 280 290 300 

CACACACCCC TTCTCATTTT GCCTCTACAT CCATATCCAC TGGGCCCTTC AGGCACCTAC 
310 3 20 330 340 350 360 

TAATGCCCTA GAACCTAAAA CCATCATCTG GGGCCCAGTT CCCTGAATGG CCCTAATCTC 
370 380 390 400 410 420 

TTCCTCTGCT GGAATGAGTC CAGTGCCCAC TTCCTCCAAC GGTGAAATTG CTGGGCTGCT 
43 0 440 450 460 470 480 

ACAGATCAGG AACTCACTGC TTCCTCATAG GGGCAGCCGA CTTCACTGCT CTGCAACAGC 
4 90 500 510 520 530 540 

GACCACCCCT AGCGAGGCTT GAGATGCCTC TTGCCTCCTT AAGACTGAGG GAGACGCTTC 
550 560 570 580 590 600 

AGCTCTCACT CCACTGCCCC AAGTCCTCCA CAGCGCGGTG CCTGCTGCCT TCACACAGAG 
610 620 630 640 650 660 

CTGCAGGGGN AGGTCCTGTG TATCCGGCCT GCTGGACCAG CGCTGTGCAC AACCCTCCCA 
670 680 690 700 710 720 

TGGCAACAGT GGCTGCCCGG CCTGCACACT GGGCTTGGCA ACCTCGCTGT AGGTATTTAT 
73 0 740 750 760 770 780 

TCCCTCAGGA GTGACTGCAT TCTTTTCCCA TTTC CAG AAA ACTGATGCCA TTTACCTCAC 
79 0 800 810 820 830 840 

TATGAGGAGG AGGAGGAGGA GGAGGGTGGA GAGTGGTACA TTTTAAAATG TGCACTATTC 
850 860 870 880 890 900 

TCCCTAGGAC TCCCCCTCAA ATAACCCAGG AGGGACCATA CCAGCTCATT CCTGTGTATC 
910 920 930 940 950 960 

CCAAGCATAN GAGTAATCAT CCCACTCATG CTGAGTGTAT GGTGGCCATT AAGCCTGCCC 
97 0 980 990 1000 1010 1020 
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TGAACTGGCT TTAGAACAAG GTGTTTGAGC ACACAGCACC GTCTTGCTGC CACCTTGGCC 
1030 1040 1050 1060 1070 1080 

CCCTCCCTTG TGAGACCTCT GAGACACATT NAGGTCTCAC CTAAAAATCT CAGGATTTCT 
109 ° 1100 1110 1120 1130 1140 

AGGCCCAAAN CGGTCCTAAA AAATTGTTCA GTCTGAACTC TCTAAGGTCA AGAGAAGAGG 
H50 1160 1170 1180 1190 1200 

TGGTTGCTCC CTCTAAGAAA CCACATGTTG CATGTACATC CTTAATTCCG GAAAGTCCAA 
1210 1220 1230 1240 1250 1260 

CAAACCTGCC CTGCTTAGCA ACACAAGCCG AGGTGGTACT CCTCTCACCC GGGCATTCTC 
I 270 1280 1290 1300 1310 1320 

CAACACACCT GTTTGTCCAA ACAGCTTTGA TTTGTTTTTA TAGTTGGACC CCAGGTTCCC 
I 330 1540 1350 1360 1370 1380 

AGGAGGCTGG TTCAGGCCAT ATTCCAAATC CTCATCTGTG TGTGAGTGGC ATTCTTAGCC 
1390 1400 1410 1420 1430 1440 

TAGCCTCCTT ACAGGGTGGA TACTATGATA CACAGCCAGG CTGTCCCAGT GGCTTTCAAT 
1450 1460 1470 1480 1490 1500 

ATTCTTTTGG TCCAGATAGT TCAGCCTCAG CACC2GTGTA GGCATCACAG GGTCAATTGT 
1510 1520 1530 1540 1550 1560 

CTTAGGAGTC ATGGAGAATT CATAGTTGGT AGCTACCTGG GCCTGGCCAG GGCTGACCAT 
1570 1580 1590 1600 1610 1620 

AGACAAGGCA TCCCTCTGTG AACTCCTATT TTAATGCCAG CTTCCCAACA AATTTCTCAA 
1630 . 1640 1650 1660 1670 1680 

CAAT box 

CTGCTCTTAC CAGCAGGTAT TTAAACTACT CAATAGAAAG TAACCCTGAA AATTAGGACA 
1690 1700 1710 1720 1730 1740 

TATA box 



CCTGTTCCCA AAAGACCCTT AAATAGGGGA AGTCCTTTCN CTGCTTGTGC ACAGCTGCTG 
1 ? 50 1760 1770 1780 1790 1800 

| - >mRNA 

ATGTGGCAAC ATGAGGCCTG GGACAGGGGA CTGTCCTCTG CCCACTCTGG TAGCCTCACG 
1810 1820 1830 1840 1850 1860 

Spsite 

-- exon 1 >###### 

TAGCTTAACA ATCTGTCAGT AATACAATAC AAAACTTAAA CTTTCATACT GCGGTTCCAC 
1870 1880 1890 1900 1910 1920 

CCAGGAAGCT GTGTTCCCAA TCTGACCCGT GATTATGGGG CCACCTCAGA GGGNACCCAG 
1930 1940 1950 I960 1970 1980 
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TGAGGGAATA TTTTGCCATC TGGGACTGTT GGTTGCTGGG GGCAGTGGCT ATGAGCTCAG 
1990 2000 2010 2020 2030 2040 

TTAATAAACT CAAGCAGTTT CCTTCCAAAC ACACATGTCC TACTTAACGT GTCCAACAGA 
2050 2060 2070 2080 2090 2100 



GATGATCATA CTCATANGCT GCTAAAACAT TANTTTTATT TTGAGAAAAG TCTATTCATG 
2110 2120 2130 2140 2150 2160 
Alu i nsert 

TTCTTGGCCC ATGGAGTTTT CATTTNATTA NTTTATTTAT TTTGCAGAGA TGGAGTCTCA 
2170 2180 2190 2200 2210 2220 

CTATGTTGCT CAAGCTGGTC TCCAACTCCT GGGCTCAAGC GATCTTCCTA CTTTGGCCTT 
2230 2240 2250 2260 2270 2280 

Li, TGAAAGCGCT GAGATTGCCT GTGTGAGCCA TCATGGGGGC TCACTGGCCC ACTGATTAAT 

p 2290 2300 2310 2320 2330 2340 

O CAGATTAATT GTTTTTTGCT ATTGAANTTG TTTGACTTCC TTGTATATTC GGATATTTAC 

£ 2350 2360 2370 2380 2390 2400 

m 

CCATTCTAAC ACGTAGGGTT TGCAAATATT TTCTCTCATG TTCTGTGTTG CCTTTTCACT 
2410 2420 2430 2440 2450 2460 

Q CAGTTGATGG TTTCCTTTGC TGTGCAGGTG CTTTAGTGTT CAACGCAGCC CCGCTTGTCT 

a 2470 2480 2490 2500 2510 2520 

jdfc ATTTTCCATT TTATTGCCTG TCCCTTTGAT GTCATAGCCA AGAAATAATT GCCCAGATTA 

P 2530 2540 2550 2560 2570 2580 

0 ATGTCAAAAA GCTTTATCCC TATATATTCT TCTAGTAGTT TATGGTTTCA GATCTTATGT 

p 2590 2600 2610 2620 2630 2640 

H TTAGGTCTTC AATCCATTGA GTTGATTTTT GTATGTGGTA TAAGAAAAAA GACCACATGT 

2650 2660 2670 2680 2690 2700 

ATACATATCT CAAATTCTAA GGTAGTATAT ATTAGACACA TACAATGTGT CTATTTACAC 
2710 2720 2730 2740 2750 2760 

ACATTGAGCT GAAAATAATA AACATATTTT TATCTTTCAA TCAACTCTAT CTCTATCTCA 
2770 2780 2790 2800 2810 2820 

CTGAACTTGT TTCACCTATA GCCTGATGAG GTTGCTGTCC TCTCTACCCC AGCTCCTATA 
2830 2840 2850, . 2860 2870 2880 

GGAGACTGCT CATCCCCTAA CCTCAAAAAC CCCTTCATGA GGGTGATAAT GCCCTTGAAT 
2890 2900 2910 2920 2930 2940 
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m 
p 



CCTGCAATGA ATTAGTTCTC TACTACAGTG GAATTCAGGT CTGTTATGAG GGTCTGGATC 
2950 2960 2970 2980 2990 3000 

TCTGAAGAGA AGAGCTCTCA TTTTCAGAAA ATAAGCAGGA TTTATTCCCT GAAATTACTG 
3010 3020 3030 3040 3050 3060 

AATTAAATCA CTGTTTCGAT TACTTTTTGC AATATTAAAA GTAAATATTT AAACAGGTAA 
3070 3080 3090 3100 3110 3120 

AAACAGAAAT AATGGTAGGG TCCTTATCAT CACCGTGAAT TCCAAGCTAG CATAGACACT 
3130 3140 3150 3160 3170 3180 

AAACCTAGAG ATTCACACTA GAATGAAAGC TGGGAGAGCA GAGGAGTCTC AGAAGGATGT 
3190 3200 3210 3220 3230 3240 

GGAGGCCAAT GGACACCTGC AACCTCTCCA ACGAAATGCC TACCTCCTCT CACTGCAGCA 
3250 3260 3270 3280 3290 3300 



TCCATCTCTG AGCCTTCTCG CAGCAGAGCT ATAAATTCAG CCTGGCTCCT CCGTTCCCAC 
3310 3320 3330 3340 3350 3360 



9 

O Spsite CDS start 

r ###< *** 

i'f| ACATCCACTC CTGCTCTCCC TCCTCTCCTC CAGGTGACTA CAGTTATGAG GACCCTCACC 

"7~~ 3370 3380 3390 3400 3410 3420 

p| Exon 2 

P CTCCTCTCTG CCTTTCTCCT GGTGGCCCTT CAGGCCTGGG CAGAGCCGCT CCAGGCAAGA 
3430 3440 3450 3460 3470 3480 

H 

O GCTCATGAGA TGCCAGCCCA GAAGCAGCCT CCAGCAGATG ACCAGGATGT GGTCATTTAC 

f=* 3490 3500 3510 3520 3530 3540 



Spsite 

>### ### 

TTTTCAGGAG ATGACAGCTG CTCTCTTCAG GTTCCAGGTG AGAGATGCCA GCATGCAGAG 
3550 3560 3570 3580 3590 3600 

CTACAGACTA GACAGAAGGA CAGGAGACAG GCTCTGGAAT TGGATCTCAG TGGCAGATGT 
3610 3620 3630 3640 3650 3660 

CACTTAGGTG GCTATACTTA ACATCTCTGG TCCTGGATTT TCTCATATCT AAATGGAATA 
3670 3680 3690 3700 3710 3720 

GAGAACCAAA GAAATCTAAG AGATTTTTCT TTCTCCAAAA ACTTGATTCC AAGATATGAC 
3730 3740 3750 3760 3770 3780 

TGTGAAATTC ACTAGATTTA AGATATAAGG AGATGCTACC TAGTTCCTTC TGGAGCCAGA 
3790 3800 3810 3820 3830 3840 
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CAAACAAGCT TAAGTATATA GGAAAATATT 
3850 3860 3870 

CTGGAGAGGA GCCTAAGAAT GTGTTCAGGT 
3910 3920 3930 

TGAAGCAAAG GAGAATGAGT CTCGAATCCT 
3970 3980 3990 

CCTATTGACT GAGATTGTTT GTGCTACCGG 
4030 4040 4050 

AACATGTGAC TTCTCCAAGA TTCCCTTTAC 
4090 4100 4110 



TCACCCTGTC TATATAGGAG GTTTTAGAAC 
3880 3890 3900 

GTGTGTGTGA TGGGCAGGAA TGCAGAAAAG 
3940 3950 3960 

GTGTGACCAG CACTGCTCTG TGTATTTATT 
4000 4010 4020 

CTGTAATACA GCCAACATCA CTCATCAGCC 
4060 4070 4080 

CACCCACTGC TGNACCCCGT ACTCAGTTTC 
4120 4130 4140 



m 

m 
o 



Spsite 

###< 

TGATGCTCTC TCTGGGTCCC CAGGCTCAAC AAAGGGCTTG ATCTGCCATT GCAGAGTACT 
4150 4160 4170 4180 4190 4200 

Exon 3 

ATACTGCATT TTTGGAGAAC ATCTTGGTGG GACCTGCTTC ATCCTTGGTG AACGCTACCC 
4210 4220 4230 4240 4250 4260 



AATCTGCTGC TACTAAGCTT GCAGACTAGA GAAAAAGAGT TCATAATTTT CTTTGAGCAT 
4270 4280 4290 4300 4310 4320 



TAAAGGGAAT TGTTATTCTT ATACCTTGTC CTCGATTTCC TGTCCTCATC CCAAATAAAT 
4330 4340 4350 4360 4370 4380 



m 

p ACTTGGTAAC ATGATTTCCG GGTTTTTTTT TTTTT 

14= 4390 4400 4410 
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DEF4 
DEFX 



GGATCCCCATTTGTCTTCAGTGTAACCC-ATTAGTTAAACCGCCTACTGCAAGGAAACCA 
ACACCATTTGTCTTCA - TGTAACCCCATTAGCTATACCCTCTAGTGCAAGGAAACCA 



DEF4 
DEFX 



DEF4 
DEFX 



ou '0 80 90 100 110 

CAAGGCTTGGATCAGATCATGAGGCTGCCCT-ACAAGTTATGCCAAAAAATATGGACTTG 

TAGGGCCTAGGTCACACCATGAGGCTGCNCTTACAAGTTATGC - AAAAACTATGGACTTG 
60 70 80 90 100 110 

120 130 140 150 160 170 

GAAGACCTGTCTGTTATAATATCACAC - CCAAATCTAACCAGCTCTGCCAATAACAGCTC 

GGAGACCTGTGCGTAACAACATCACACNCCAAATTTAACCAGCTCTCCCCATAACAGCAC 
120 130 140 150 160 170 



O 

o 

£ 

in 

H 

m 
o 

i* 
o 



DEF4 



GCTCATGTGTTACTGAGGAAATGCCTGTGGATTGGAGTGTGTTCTGTGTGCAGGAGGCTG 
180 190 200 210 220 230 

240 250 260 270 280 290 

GTCCAGGTTTCACTTCTGCAGGACACTGGACATC - CCCACAACCACCAGACCTTCCCCAC 



DEFX GTCCAGGTTTCACTTCTGCAGGACACTGGACGTTTCCCAAAACCAGCAGACTTTCCCCAC 
240 250 260 270 280 290 

300 310 320 330 340 350 

DEF4 GTGCACACACACCCCTTCTCATTTTGCCTCTACATCCATATCCACTGGGCCCTTCAGGCA 

DEFX GTGCACACACACCCCTTCTCATTTTGCCTCTACATCCATATCCACTGGGCCCTTCAGGCA 
300 310 320 330 340 350 

360 370 380 390 400 410 

DEF4 CCTACTAATGCCCTAGAACCTAAAACCATCATCTGGGGCCCAGTTCCCCAAATAGCCCTA 

DEFX CCTACTAATGCCCTAGAACCTAAAACCATCATCTGGGGCCCAGTTCCCTGAATGGCCCTA 
360 370 380 390 400 410 

420 430 440 450 460 470 

DEF4 ATTTCTTCCTCTGCTGGAATGAGTCCAGTGCCCACTTCCTCCAAAGGTGAAATTGCTGGG 

DEFX ATCTCTTCCTCTGCTGGAATGAGTCCAGTGCCCACTTCCTCCAACGGTGAAATTGCTGGG 
420 430 440 450 460 470 



480 490 500 510 520 530 

DEF4 CCTGCAACAGATCAGGAACTCACTGCTTC - TCATAGGGGCAGCCGACTTCACTGCTCTGG 

DEFX C-TGCTACAGATCAGGAACTCACTGCTTCCTCATAGGGGCAGCCGACTTCACTGCTCTGC 
480 490 500 510 520 530 
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m 



m 
a 



DEF4 
DEFX 



DEF4 
DEFX 



DEF4 
DEFX 



DEF4 
DEFX 



DEF4 
DEFX 



DEF4 
DEFX 



DEF4 
DEFX 



DEF4 
DEFX 



DEF4 
DEFX 



540 550 560 570 580 590 

AACAGCGACCACCCCTAGCGAGGCTTGAGATGCCTCTTCCCTCCTTAAGACTGAGAGCGC 

AACAGCGACCACCCCTAGCGAGGCTTGAGATGCCTCTTGCCTCCTTAAGACTGAGGGAGA 
540 550 560 570 580 590 

600 610 620 630 

CGCT - GCCCCCAGTCCTCCATAGCCCAGTGCCTGGCTGCCTTCA 

CGCTTCAGCTCTCACTCCACTGCCCCAAGTCCTCCACAGCGCGGTGCCTG - CTGCCTTCA 
600 610 620 630 640 650 

640 650 660 670 680 690 

GCCAGAGCTGCAGGGG-AGGCCCTGAGCACCCAAGTCCTGCTGGACCAGCGCTGTGCACG 

CACAGAGCTGCAGGGGNAGGTCCTGTGTATCC - -GGCCTGCTGGACCAGCGCTGTGCACA 
660 670 686 690 700 710 

700 710 720 730 740 750 

GCCCTCCCATGGCGGCAGGGGCTGCCTGGACTGCATACTGGGTTCAGCAACCTCACTATA 

ACCCTCCCATGGCAACAGTGGCTGCCCGGCCTGCACACTGGGCTTGGCAACCTCGCTGTA 
720 730 740 750 760 770 

760 770 780 790 800 810 

GGTATTCATTCCCTCAGGAACAACTGCATTCTTTTCTCATTTCCAGAAACCTCATCCCGT 



GGTATTTATTCCCTCAGGAGTGACTGCATTCTTTTCCCATTTCCAGAAAACTGATGCCAT 
780 790 800 810 820 930 



820 830 840 

TTACCTCACTACAAGGAGGAGGATG- • 



850 860 
■ -GTGGAGAGTGGTACATTTTAAAATGT 

TTACCTCACTATGAGGAGGAGGAGGAGGAGGAGGGTGGAGAGTGGTACATTTTAAAATGT 
840 850 860 870 880 890 

870 880 890 900 910 920 

GCACTAGTCTCCCTGGGACTCCCCTTCAAATAACCCAGGAGGGACCACACAAGGGAAAGC 

GCACTATTCTCCCTAGGACTCCCCCTCAAATAACCCAGGAGGGACCATACCAGCTCATTC 
900 910 920 930 940 950 

930 940 950 960 970 980 

TTATGCATCCCCCCCACCC - AGTGACCATCTTCCTAACTCTGGGTGTAGGGAGACTCGTA 

CTGTGTATCCCAAGCATANGAGTAATCATCCCACTCATGCTGAGTGTATGGTGGCCATTA 
960 970 980 990 1000 1010 

990 1000 1010 1020 1030 1040 

AGCCTACG - - GGATTGGTTTGGGAACAGGGTATTTGAGCTCACAACACAAGGTGATGCAA 



AGCCTGCCCTGAACTGGCTTTAGAACAAGGTGTTTGAGCACACAGCACCG- 
1020 1030 1040 1050 1060 
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DEF4 
DEFX 



1050 1060 1070 1080 1090 1100 

GCTAACACCAATCTCGCTGCAGCTTTGGCCACCATCCTAAGG - GACTTCTGACAGACATT 



DEF4 
DEFX 



1110 1120 1130 1140 1150 1160 

- AGGTGTCACGCAATCATTTGATGAGTCCTTGGCCTGGAT - -GACCTAGACAGTCATTTA 



NAGGTCTCACCTAAAAATCTCAGGATTTCTAGGCCCAAANCGGTCCTAAAAAATTGTTCA 
1120 1130 1140 1150 1160 1170 



P 
Q 



1170 1180 1190 1200 1210 1220 

DEF4 GGCTTGAACTATCTAAGGCCAAGCAAAAAGGTGACTGTCCCCTCTAGGAA - CCACATGCT 



DEFX GTCT - GAACTCTCTAAGGTCAAGAGAAGAGGTGGTTGCTCCCTCTAAGAAACCACATGTT 
1180 1190 1200 1210 1220 



1230 1240 1250 1260 1270 

ATATGCACATCCTTTACTCGGGAGCCTGCAAC CTGCCCTATCCAGCAACACAAGCC 



X GCATGTACATCCTTAATTCCGGAAAGTCCAACAAACCTGCCCTGCTTAGCAACACAAGCC 
1230 1240 1250 1260 1270 1280 



1280 1290 1300 1310 

DEF4 CAGGCG - TATTCAGTCTCATCCAGGTATTCTCCAAC - 



1320 1330 
- CTTACTTGTCTGAATGGCTTG 



DEFX GAGGTGGTACTCC - TCTCACCCGGGCATTCTCCAACACACCTGTTTGTCCAAACAGCTTT 
1290 1300 1310 1320 1330 1340 



P 

CO 
C 



DEF4 
DEFX 



DEF4 
DEFX 



1340 1350 1360 1370 1380 1390 

GATTTGTTTTTATGGTTAGACCCCAGGG - CCTGGGAGGTCAGTTCAGACCACATTCCAAA 



GATTTGTTTTTATAGTTGGACCCCAGGTTCCCAGGAGGCTGGTTCAGGCCATATTCCAAA 
1350 1360 1370 1380 1390 1400 



1400 1410 1420 1430 1440 1450 

TCCTCATCTGTGTGTGGGTGGCATTTTGATCCTAGTCTCCTCGCAAGGTGTATACAACAA 



DEF4 
DEFX 



1460 1470 1480 1490 1500 1510 

TATGCAGGCCAGGCTCTCCTGGTGGCTTTAAATATTCCCTCGGTCCAGGTAGTTCAGCCT 



TACACAG - CCAGGCTGTCCCAGTGGCTTTCAATATTCTTTTGGTCCAGATAGTTCAGCCT 
1470 1480 1490 1500 1510 1520 



1520 1530 ' 1540 1550 1560 1570 

DEF4 - CAGCCACCAGCATAGGTATCATGGGGTCAATTGTCTTAGGAGTCATGAGGAATCCACAGT 



DEFX CAGC-ACCAGTGTAGGCATCACAGGGTCAATTGTCTTAGGAGTCATGGAGAATTCATAGT 
1530 1540 1550 15'60 1570 1580 
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1580 1590 1600 1610 1620 1630 

DEF4 TGATTGCTGCCTGGGCCTGGCCAGGGCTGACCAAAGTAGACGAGGGGTCGGTACCTCCGT 



1590 1600 1610 1620 1630 

1640 1650 1660 1670 1680 1690 

DEF4 GGACTCCTGCTTGAACTCCAGCTTTCTGCCAAATTTCTCAACTGCCCTTGTTAACAGTTA 



DEFX GAACTCCTATTTTAATGCCAGCTTCCCAACAAATTTCTCAACTGCTCTTACCAGCAGGTA 
1640 1650 1660 1670 1680 1690 

CAAT box 

1700 1710 1720 1730 1740 1750 

DEF4 TTTAAAGTACCCAATAGAAAGTAACGCTGAAAAATTAGGACACCTGATACCAAAAGACCC 

DEFX TTTAAACTACTCAATAGAAAGTAACCCTGAAAA - TTAGGACACCTGTTCCCAAAAGACCC 
1700 1710 1720 1730 1740 1750 

TATA box 

1770 1780 1790 1800 

DEF4 TTAAATAAGG- AAGTCCTCTC - CTCTGTGTGCATGGCTGCTCTTG CTACATAAGACC 



/ DEFX TTAAATAGGGGAAGTCCTTTCNCTGCTTGTGCACAGCTGCTGATGTGGCAACATGAGGCC 
HI 1760 1770 1780 1790 1800 1810 

jU 

m mRNA start --> SpSite 

!S 1810 1820 | 1830 1840 1850 1860 

W " DEF4 TGGAACACAGGACTGCTGTCTGCCCTCTCTGCTCGCCCTGCCTAGCTTGAGGATCTGTAA 

DEFX TGGGACAGGGGACTGTCCTCTGCCCACTCTGGTAGCCTCACGTAGCTTAACAATCTGTCA 
Q 1820 1830 1840 1850 1860 1870 

-- 1880 1890 1900 1910 1920 

DEF4 GTAACACAA AACTTAAACTTTCACATTGAGGTTTCAATATTGAAGCTGTGTCCCC 

0 :::= :=== :::::::::::::: : := :::: :: :::::::::: ::: 

|=4 DEFX GTAATACAATACAAAACTTAAACTTTCATACTGCGGTTCCACCCAGGAAGCTGTGTTCCC 

1880 1890 1900 1910 1920 1930 

1930 1940 1950 1960 1970 1980 

DEF4 AGTCTGACCTCTCACTGTGGGGCCACCCCAGAGGACCCAGCGTGAAGCCCCTGCTGTGAA 

DE FX AATCTGACCCGTGATTATGGGGCCACCTCAGAGGGNACCCAGTGAGGGAA - TATTTTG - - 
1940 1950 1960 1970 1980 1990 

1990 2000 2010 2020 2030 2040 

DEF4 CTTCTATCTGGGTGTCTGGCGGCTGCTGGGGGTAATGGCTACTAGCTAAGTCAATAGAGA 



DEFX - - - CCATCTGGGA - -CTGTTGGTTGCTGGGGGCAGTGGCTATGAGCTCAGTTAATA- ■ 
2000 2010 2020 2030 2040 
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DEF4 
DEFX 



2050 2060 2070 2080 2090 2100 

AACTCAAAAAGTTTCCTTCCAAACACACGTGTCCTACTTGACATGTCCAATAAAGACGAT 



AACTCAAGCAGTTTCCTTCCAAACACACATGTCCTACTTAACGTGTCCAACAGAGATGAT 
2050 2060 2070 2080 2090 2100 



DEF4 
DEFX 



2110 2120 2130 2140 

- CAGCTTCT - - TAAAACATTA- TTTTATTGTGAGAGAAGCCTCT - 



CATACTCATANGCTGCTAAAACATTANTTTTATTTTGAGAAAAGTCTATTCATGTTCTTG 
2110 2120 2130 2140 2150 2160 



DEF4 
DEFX 



GCCCATGGAGTTTTCATTTNATTANTTTATTTATTTTGCAGAGATGGAGTCTCACTATGT 
2170 2180 2190 2200 2210 2220 



DEF4 
DEFX 



2160 
GTTTTTC- 



TGCTCAAGCTGGTCTCCAACTCCTGGGCTCAAGCGATCTTCCTACTTTGGCCTTTGAAAG 
2230 2240 2250 2260 2270 2280 



m 

3 

O 
\+ 

a 



DEF4 
DEFX 



DEF4 
DEFX 



CGCTGAGATTGCCTGTGTGAGCCATCATGGGGGCTCACTGGCCCACTGATTAATCAGATT 
2290 2300 2310 2320 2330 2340 



2200 2210 2220 2230 

i - GTTGTTTGACTTCCTTATGTATTCAGATATTTACCCCTTC 



AATTGTTTTTTGCTATTGAANTTGTTTGACTTCCTTGTATATTCGGATATTTACCCATTC 
2350 2360 2370 2380 2390 2400 



2270 2280 2290 

IATTTTCTGGGTTGCCGTTTCCCTCAGTTG 



2410 2420 2430 2440 2450 2460 

2300 2310 2320 2330 2340 2350 

DEF4 ATTGTTTCCTTTGCTATGAAGATGCTTTAGCGTTCAATGCAGCCCCGCTTGTCTATTTTC 



DEFX ATGGTTTCCTTTGCTGTGCAGGTGCTTTAGTGTTCAACGCAGCCCCGCTTGTCTATTTTC 
2470 2480 2490 2500 2510 2520 

2360 2370 2380 2390 2400 2410 

DEF4 CCATTTGTTTATTGCCTGTGCCTTTGGTGTCATAGCCAAGAAATCATTACTCACGTCAAT 

DEFX C - ATTT TATTGCCTGTCCCTTTGATGTCATAGCCAAGAAATAATTGCCCAGATTAAT 

2530 2540 2550 2560 2570 2580 
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2420 2430 
DEF4 GTCCAAA - GCTTTA' 



DEFX GTCAAAAAGCTTTATCCCTATATATTCTTCTAGTAGTT - TATGGTTTCAGATCTT - • 
2590 2600 2610 2620 2630 



o 



m 



O 



DEF4 
DEFX 



DEF4 
DEFX 



DEF4 
DEFX 



DEF4 
DEFX 



DEF4 
DEFX 



DEF4 
DEFX 



DEF4 
DEFX 



DEF4 
DEFX 



2480 2490 2500 2510 2520 2530 

GTCTATGTTGAG - TCTTCAATCCATGTTGAGCTGATTTTT - TAC ATGTTGTGAGAGAAAG 

ATGTTTAGGTCTTCAATCCA- -TTGAGTTGATTTTTGTATGTGGTATAAGAAAAAA 

2640 2650 2660 2670 2680 2690 

2540 

GACCACGTGTATGCACCT 



GACCACATGTATACATATCTCAAATTCTAAGGTAGTATATATTAGACACATACAATGTGT 
2700 2710 2720 2730 2740 2750 



2550 2560 
- - -AGC - - - AACTCATGAAC - 



2570 

- CTTACA - - CAACTCTTT 



CTATTTACACACATTGAGCTGAAAATAATAAACATATTTTTATCTTTCAATCAACTCTAT 
2760 2770 2780 2790 2800 2810 

2580 2590 2600 2610 2620 2630 

ATCTCTCTCACTGAGCTCATTTCACCTGTACCCTGATAAGGTCATTGTCCTCTTCACTCT 

CTCTATCTCACTGAACTTGTTTCACCTATAGCCTGATGAGGTTGCTGTCCTCTCTACCCC 
2820 2830 2840 2850 2860 2870 

2640 2650 2660 2670 2680 2690 

GGCCCCTACAGGAGACTACTCACCCCATTACCTCAGTCGCCCCTTCATGAGGGT-ATAAT 

AGCTCCTATAGGAGACTGCTCATCCCCTAACCTCAAAAACCCCTTCATGAGGGTGATAAT 
2880 2890 2900 2910 2920 2930 

2700 2710 2720 2730 2740 2750 

GACCTAGAAGCCTGCAATGAGTTACT - CTCTACTCCACCGGAATTCAGGTCTGGCACCAG 

GCCCTTGAATCCTGCAATGAATTAGTTCTCTACTACAGTGGAATTCAGGTCTGTTATGAG 
2940 2950 2960 2970 2980 2990 

2760 2770 2780 2790 2800 2810 

TGTTTAGACCT - - GAAGAGAATAGTAGGGCCCATTATCAGGAAATAAGAGGCATTTGCTC 



GGTCTGGATCTCTGAAGAGAAGAG - 
3000 3010 



- CTCTCATTTTCAGAAAATAAGCAGGATTTATTC 
3020 3030 3040 



2820 2830 2840 2850 2860 2870 

TCTTAAATTATTGAATGAAAGCACTGTTTCCATT - CTTTTTAGAATATTAAAGATTTAAC 



CCTGAAATTACTGAATTAAATCACTGTTTCGATTACTTTTTGCAATATTAAA - ■ 
3050 3060 3070 3080 3090 
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2880 2890 2900 2910 2920 2930 

DEF4 CAGGAAATATTAGGTATTTCCTGAAAACAGGAAAAAATGCCAGGGTCCTCATCATCACCA 

DE FX - AGTAAATATTTA - - AACAGGTAAAAACAG - AAATAATGGTAGGGTCCTTATCATCACCG 
3100 3110 3120 3130 3140 3150 

2940 2950 2960 2970 2980 

DEF4 TCAACTTCAACCTAGGCACAGACACTAAACATAGAGCTTC - - - CTGTGAAGAAAGCTGGG 

DEFX TGAATTCCAAGCTAG - CATAGACACTAAACCTAGAGATTCACACTAGAATGAAAGCTGGG 
3160 3170 3180 3190 3200 3210 

2990 3000 3010 3020 3030 3040 

DEF4 AGAGCAGAGGAGGCATTCCAGGGATGTCAAGGCCAATAGGAGTCGGCATCCTCTCTAACA 

DEFX AGAGCAGAGGAGTC - TCAGAAGGATGTGGAGGCCAATGGACACCTGCAACCTCTCCAACG 
3220 3230 3240 3250 3260 3270 

3050 3060 3070 3080 3090 3100 

DEF4 AAATGCACACCTCCTCTCACTCAGAAGGCCAAAGGTTTCTTATCTCTGTGCCTTCTCCCA 

DEFX AAATGCCTACCTCCTCTCACT GC AGCATCC- - ATCTCTGAGCCTTCTCGCA 

3280 3290 3300 3310 3320 



H 3110 3120 3130 3140 3150 3160 

fr£ DEF4 GAA - AGCTATAAATCCAAGCTGGCTTCTCCCTCCCCACACAGCTGCTCCTGCTCTCCCTC 

DEFX GCAGAGCTATAAATTCAGCCTGGCTCCTCCGTTCCCACACATCCACTCCTGCTCTCCCTC 
^ 3330 3340 3350 3360 3370 3380 

p < exon2 

|s4 3170 3180 3190 3200 3210 3220 

DEF4 CTC CAGGTCACCCCAGCCATGAGGATTATCGCCCTCCTCGCTGCTATTCTCTTGG 

p DEFX CTCTCCTCCAGGTGACTACAGTTATGAGGACCCTCACCCTCCTCTCTGCCTTTCTCCTGG 

3390 3400 3410 3420 3430 3440 



3230 3240 3250 3260 3270 3280 

DEF4 TAGCCCTCCAGGTCCGGGCAGGCCCACTCCAGGCAAGAGGTGATGAGGCTCCAGGCCAGG 

DEFX TGGCCCTTCAGGCCTGGGCAGAGCCGCTCCAGGCAAGAGCTCATGAGATGCCAGCCCAGA 
3450 3460 3470 3480 3490 3500 



3290 3300 3310 3320 3330 3340 

DEF4 AGCAGCGTGGGCCAGAAGACCAGGACATATCTATTTCCTTTGCATGGGATAAAAGCTCTG 

DEFX AGCAGCCTCCAGCAGATGACCAGGATGTGGTCATTTACTTTTCAGGAGATGACAGCTGCT 
3510 3520 3530 3540 3550 3560 
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3350 3360 3370 3380 3390 3400 

DEF4 CTCTTCAGGTTTCAGGTGAGAGAGGCCAGCATAAAAAAGCTACCGAGTCTAGAGAGACGG 

DEFX CTCTTCAGGTTCCAGGTGAGAGATGCCAGCATGCAGA - GCTAC - - AGACTAGACAGAAGG 
3570 3580 3590 3600 3610 



DEF4 
DEFX 



3410 3420 3430 3440 3450 3460 

ATGGGAGATGGGCTCTGGAATCACATCTCAATGGTGGATGTCACTTAGGTGGCTTTACTT 



ACAGGAGACAGGCTCTGGAATTGGATCTCAGTGGCAGATGTCACTTAGGTGGCTATACTT 
3620 3630 3640 3650 3660 3670 



0 



DEF4 
DEFX 



3470 3480 3490 3500 3510 3520 

ACCATCTCTGGGCCTCGATTTTCTTATCTCGAAACTGAATAGAGAGACAAACAAATGTAA 



AACATCTCTGGTCCTGGATTTTCTCATATCTAAATGGAATAGAGAACCAAAGAAATCTAA 
3680 3690 3700 3710 3720 3730 



DEF4 
DEFX 



GAGATTTTTCTTTCTCCAAAAACTTGATTCCAAGATATGACTGTGAAATTCACTAGATTT 
3740 3750 3760 3770 3780 3790 



3590 3600 3610 3620 3630 

DEF4 AAGATATGGAGAGACAGATTGACCAGTTCTTTCTGGATCTAAACAAGTA-GAT- -ATTAT 



S 

O 
M 



DEFX AAGATATAAGGAGATG - - CTACCTAGTTCCTTCTGGAGCCAGACAAACAAGCTTAAGTAT 
3800 3810 3820 3830 3840 3850 



AGTATGTTCAGGTGTGTGTCTGATGGGGCA - - AAAG C AC AC AAAT CAG AGC AAAAG AG AA 



DEFX AATGTGTTCAGGTGTGTGTGTGATGGG-CAGGAATGCAGAAAAGTGA-AGCAAAGGAGAA 
3920 3930 3940 3950 3960 3970 

3760 3770 3780 3790 3800 3810 

DEF4 TGAGTCTCAAATCCTGTATGAGCAGCATTGCTCTGTGTATTTATTCCTATTGACTAAGGT 

DEFX TGAGTCTCGAATCCTGTGTGACCAGCACTGCTCTGTGTATTTATTCCTATTGACTGAGAT 
3980 3990 4000 4010 4020 4030 

3820 3830 3840 3850 3860 3870 

DEF4 TGTTTGTGCTACCGGCACTAATGCAGCCAGCATCACCGGTCAGCCAGCATGTGCATTCTC 

DEFX TGTTTGTGCTACCGGCTGTAATACAGCCAACATCACTCATCAGCCAACATGTGACTTCTC 
4040 4050 4060 4070 4080 4090 
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3880 3890 39O0 3910 3920 3930 

DEF4 CAAGATTCCCTTTACCACCCACCGCTGACCTTGGTGCTTAATTTCTCAGTCTTCCTCTGT 

DEFX CAAGATTCCCTTTACCACCCACTGCTGNACCCCGTACTCAGTTTCTGATGCTCTCTCTGG 
4100 4110 4120 4130 4140 4150 

< exon3 

3940 3950 3960 3970 3980 3990 

DEFX GTCCCCAGGCTCAACAAAGGGCTTGATCTGCCATTGCAGAGTACTATACTGCATTTTTGG 
4160 4170 4180 4190 4200 4210 

exon3 

4000 4010 4020 4030 4040 4050 

DEF4 AGAACTTCGTGTTGGGAACTGCCTCATTGGTGGTGTGAGTTTCACATACTGCTGCACGCG 

DEFX AGAACATCTTGGTGGGACCTGCTTCATCCTTGGTGAACGCTACCCAATCTGCTGCT 

4220 4230 4240 4250 4260 4270 

exon3 

4060 4070 4080 4090 4100 4110 

DEF4 TGTCGATTAACATTCTGCTGTCCAAGAGAATGTCATGCTGGGAACGCCATCATCGGTGGT 

DEFX ACTAA 

exon3 

4120 4130 4140 4150 4160 4170 

DEF4 GTTAGCTTCACATGCTTCTGCAGCTGAGCTTGCAGAATAGAGAAAAATGAGCTCATAATT 

DEFX - 



4180 4190 4200 4210 4220 4230 

DEF4 TGCTTTGAGAGCTACAGGAAATGGTTGTTTCTCCTATACTTTGTCCTTAACATCTT - TCT 

DEFX TTCTTTGAGCATTAAAGGGAATTGTTATT - - - CTTATACCTTGTCCTCGATTTCCTGTCC 
4310 4320 4330 4340 4350 4360 



4240 4250 4260 4270 4280 4290 

DEF4 TGATCCTAAATATATATCTCGTAACAAGATGTCTTTGTTTACACCTCTTTGAAATTTGAT 

DEFX TCATCCCAAATAAATACTTGGTAACATGATTTCCGGGTTTTTTTTTTTTT 
4370 4380 4390 4400 4410 
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10 20 30 40 50 60 

DEF4 GTCTGCCCTCTCTGCTCGCCCTGCCTAGCTTGAGGATCTGTCACCCCAGCCATGAGGATT 

DEFX CTCTGCCCACTCTGGTAGCCTCACGTAGCTTAACAATCTGTGACTACAGTTATGAGGACC 
10 20 30 40 50 60 

70 80 90 100 110 120 

DEF4 ATCGCCCTCCTCGCTGCTATTCTCTTGGTAGCCCTCCAGGTCCGGGCAGGCCCACTCCAG 

DEFX CTCACCCTCCTCTCTGCCTTTCTCCTGGTGGCCCTTCAGGCCTGGGCAGAGCCGCTCCAG 
70 80 90 100 110 120 

130 140 150 160 170 180 

DEF4 GCAAGAGGTGATGAGGCTCCAGGCCAGGAGCAGCGTGGGCCAGAAGACCAGGACATATCT 

DE FX GCAAGAGCTCATGAGATGCCAGCCCAGAAGCAG CCTC CAGCAGATGACCAGGATGTGGTC 
130 140 150 160 170 180 

190 200 210 220 230 240 

DEF4 ATTTCCTTTGCATGGGATAAAAGCTCTGCTCTTCAGGTTTCAGGCTCAACAAGGGGCATG 

DEFX ATTTACTTTTCAGGAGATGACAGCTGCTCTCTTCAGGTTCCAGGCTCAACAAAGGGCTTG 
190 200 210 220 230 240 



s 
0 

m 



DEF4 

DEFX ATCTGCCATTGCAGAGTACTATACTGCATTTTTGGAGAACATCTTGGTGGGACCTGCTTC 
01 250 260 270 280 290 300 

o 

310 320 330 340 350 360 

' DEF4 ATTGGTGGTGTGAGTTTCACATACTGCTGCACGCGTGTCGATTAACGTTCTGCTGTCCAA 

p DEFX ATCCTTGGTGAACGCTACCCAATCTGCTG CTACTAA 

!■* 310 320 330 340 350 

00 

j=t 370 380 390 400 410 420 

W DEF4 GAGAATGTCATGCTGGGAACGCCATCATCGGTGGTGTTAGCTTCACATGCTTCTGCAGCT 

DEFX 

360 370 380 390 

430 440 450 460 470 480 

DEF4 GAGCTTGCAGAATAGAGAAAAATGAGCTCATAATTTGCTTTGAGAGCTACAGGAAATGGT 

DEFX - - GC TTG CAG ACTAG AGAAAAA - GAGTTCATAATTTTCTTTGAGCATTAAAGGGAAT - - - 
400 410 420 430 440 450 

490 500 510 520 530 

DEF4 TGTTTCTCCTATACTTTGTCCTTAACATCTT - TCTTGATCCTAAATATATATCTCGTAAC 

DEFX 



DEFX ATG 



Figure 3 
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Signal peptide 



MetArgThrLeuThr LeuLeuSerAlaPhe LeuLeuValAlaLeu GlnAlaTrpAlaGlu 



Propiece 

25 30 35 40 

ProLeuGlnAlaArg AlaHisGluMetPro AlaGlnLysGlnPro ProAlaAspAspGln 



AspValVallleTyr PheSerGlyAspAsp SerCysSerLeuGln ValProGlySerThr 



Mature peptide 



LysGlyLeuIleCys HisCysArgValLeu TyrCysIlePheGly GluHisLeuGlyGly 

Mature peptide > 

85 90 94 

ThrCysPhelleLeu GlyGluArgTyrPro IleCysCysTyr 



Figure 4 



DEF4_HUMAN 
DEF5_HUMAN 
DEF6_HUMAN 
DEF1_HUMAN 
DEFX 



SIGNAL PROPIECE 

"£"^^ LLVALQVRA GPLQAR GDEAPGQ-EQRGPEDQDISISFAWDKSS 

Sr^™™^ ESLQER ADEATTQ-KQSGEDNQDLAISFAGNGLS 

MRTLTILTAVLLVALQAKA E PLQ AEDDPLQAKAYEADAQ - EQRGANDQD FAVS FAEDAS S 

MRTLAILAAILLVALQAQA EPLQAR ADEVAAAPEQIAADIPEWVSLAWDESL 

MRTLTLLSAFLLVALQAWA EPLQAR AHEMPAQ - KQPPADDQDWI YFSGDDS C 



DEF4_HUMAN 
DEF5_HUMAN 
DEF6_HUMAN 
DEF1_HUMAN 
DEFX 



PROPIECE Mature PEPTIDE 

ALQVSGSTRGM VCSCRLVFCRRTELRVGNCL IGGVS FTYCCTRVD 

ALRTSGSQARA TCYCRTGRCATRESLSGVCE I SGRLYRLCCR - - - 

SLRALGSTRAF TCHCRR - SCYSTEYSYGTCTVMGINHRFCCL - - - 

APKHPGSRKNM AC YCR I PAC I AGERRYGTC I YQGRLWAFCC 

SLQVPGSTKGL I CHCRVL YC I FGEHLGGTC F I LGER YP I CCY - - - 



Figure 5 



